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Abstract 



The Riemannian Bures metric on the space of (normalized) complex positive 
matrices is used for parameter estimation of mixed quantum states based on 
repeated measurements just as the Fisher information in classical statistics. 
It appears also in the concept of purifications of mixed states in quantum 
physics. Therefore, and also for mathematical reasons, it is natural to ask for 
curvature properties of this Riemannian metric. Here we determine its scalar 
curvature and Ricci tensor and prove a lower bound for the curvature on the 
submanifold of trace one matrices. This bound is achieved for the maximally 
mixed state, a further hint for the statistical meaning of the scalar curvature. 
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I. INTRODUCTION 



Let T> denote the space of complex positive nxn-matrices for a fixed n and T> 1 the 
submanifold of trace one matrices. V 1 represents the space of nondegenerate mixed states 
of a n-dimensional quantum system. The tangent space at g G T> (resp. T> 1 ) consists of 
all Hermitian (traceless) matrices. These manifolds carry the so called Riemannian Bures 
metric g defined by 

9q{X, Y) — ^Tr XG , X,YeT e V, 

where G is the (unique, by the Sylvester-Rosenblum theorem, see ||) solution of gG + Gg = 
Y. It should be mentioned, that g is also well defined on manifolds of all g > of fixed rank, 
but we will deal only with the maximal rank. This Riemannian metric was introduced by 
Uhlmann in generalizing the Berry phase to mixed states, He was led to this metric 

by asking for curves on minimal length purifying a given path of densities. Later on this 
metric appeared also in other contexts, see e. g. 

The restriction of g to the manifold of trace one diagonal matrices, i. e. to the manifold 
of all probability distributions on a n-point set, is (up to the factor 1/4) just the Fisher met- 
ric known from classical statistics, see e. g. |2|||. Similarly to this case the Bures metric is 
related to the statistical distance of quantum states, see P,[T0||. Roughly speaking, both met- 



rics give a lower bound for the variance of an optimal parameter estimator. Thus the Bures 
metric generalizes the classical Fisher information to the quantum case. Among other gen- 
eralization, namely the so called monotone metrics (i. e. metrics decreasing under stochastic 
mappings), the Bures metric is minimal, and it seems to play a distinguished role also 



for other reasons, see |T7|JTq| . Partial results concerning the curvature of the Bogoliubov 



metric, another monotone metric, were obtained in [14 . 

Several authors, e. g. suggested that the scalar curvature has a quantum statis- 

tical meaning as a measure of local distinguishability of states in the sense, that regions of 
small curvature require many measurements for distinguishing between neighboring states. 
But this is still in progress and, up to now, no statistical equation or estimation involving 
the scalar curvature seems to be available. However, we show that the scalar curvature is 
minimal for the maximally mixed state -1 and that it diverges nearby pure states, further 
hints for the suggested statistical meaning. 

We determine here the Ricci tensor and the scalar curvature (Propositions 2 and 3) 
completing the list of basic local curvature quantities of the Bures metric. 
Notations: The eigenvalues of a positive matrix g are denoted by Aj. Thus, if we assume 
g to be diagonal, then g = diag(Xi, . . . , A n ). Bold letters are used for operators acting on 
matrices. They will depend on g, so that they actually represent fields of operators called 
by several authors superoperators. However, we frequently suppress this dependence for 
brevity of notation similarly to vector field and other quantities. In particular, L g and 
denote the operators of left and right multiplication by g and j^p- is the inverse operator 
of L + R (denoted by 1Z~ 1 in [[J). This operator appears in many of the following formulae 
and is a serious obstruction for using coordinates in handling the Bures metric which now 
reads 

1 1 

q — — Tr dg — (do) . 

y 2 L + R 
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However, from the theory of matrix equations, @,[|, some explicit formulae for this operator 
and the metric can be derived, [19 . 



II. THE RIEMANNIAN CURVATURE TENSOR 

In this section we explain some results concerning the Riemannian curvature tensor of the 
Bures metric and introduce on this occasion some further notations. A brief communication 



of this results appeared in [JL1]]. Proofs and more details can be found in ||12|| . 

The manifold T> is an open subset of the space of Hermitian matrices and all tangent 
spaces T g T> are identified with this real vector space. Thus we regard vector fields X,Y, . . . 
on D as functions g h- > X e on T> with Hermitian (traceless for V 1 ) values. The flat covariant 
derivative on T> inherited from the affine structure of the Hermitian matrices is simply the 
derivation along straight lines; (V x Y) g = lim^o (Y e +tx e — Y e )/t. In particular, V X N = X, 
where N is the vector field defined by N e :— g. It is perpendicular w. r. to the Bures metric 
to the submanifold T> 1 . This allows for determining curvature quantities of T) 1 from that of 
T> by the Gauss equation. Quantities with superscript 1 will always refer to T? 1 . 

We denote by V resp. V 1 the covariant derivative of the Levi-Civita connection of the 
Bures metric on T> resp. T> . In []12| it was shown that 

W X Y = V f x Y - ^(X)N^(Y) - j^(Y)N^(X) (la) 
V 1 x Y = V x Y + 2g(X,Y)N, (lb) 

where on the right hand side of (|Ii|) appears the usual product of matrix valued functions. 
Of course, in order to apply flTE| ) one must extend the vector fields X and Y on T* 1 to a 
neighborhood of V 1 , but the result will not depend on this extension. ( |IaD corresponds to 
the well known equation 

V di a j d J =d t (a j )d J + a j T^d kl 

which relates the covariant derivative to the flat derivative induced by a local parametriza- 
tion. The calculations of the next section are based on the following Proposition derived 
from (fl|). 

Proposition 1 : 

The curvature tensor field of the Bures metric on T> resp. V 1 is given by 

U(W, Z, X, Y) := g (v x v Y - v Y v x - v [XiY] ,w) 

= 2^(iLR[ E ^X, E ^r],i[ E ^H/, I ^Z]) 
+ 9 (iLR [j^Z, j^Y] , i W, j^X] ) 
- g (i LR [^Z, ^X] M^W, ^Y] ) , (2a) 

K\W, Z, X,Y)= U (W, Z, X, Y) + g(Y, Z)g(X, W) - g(X, Z)g{Y, W) . (2b) 

□ 



Note the different meaning of commutators in the equations above. In (Pa) it is pointwise 
the usual matrix commutator, [X, Y] g := X Q Y g — Y e X Q . All further commutators will be 
understood in this sense. An immediate consequence of Proposition 1 is 
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Corollary 1: 

Let p be the plane generated by two tangent vectors X and Y . Then the sectional curvature 
is given by 

w v TZ{X,Y,X,Y) 
K{p) ■ 



g(X,X)g(y,Y)-g(X,Y)* 

= ^ LR [lTr*» lTr*] ; j [lTr lTr^] ) 

g(X,X)g(Y,Y)-g(X,Yy 

K\p) = K(p) + 1. 

□ 

Finally we mention that for n = 2 the Riemannian manifold (T* 1 , g) is isometric to an open 
half 3-sphere of radius 1/2, ||. The geometry for n > 2 is much more complicated, e. g. T> 1 



is not locally symmetric, 12 



III. RICCI TENSOR AND SCALAR CURVATURE 

In order to determine the Ricci tensor and the scalar curvature we have to calculate 
traces of the curvature given by Proposition 1. For clarity we will distinguish in the notation 
between the trace of matrices and the trace of operators acting on matrices. We will treat 
simultaneously the normalized and the unnormalized case. For brevity we include in brackets 
additional terms corresponding to the normalized case . 

First we determine the curvature mapping, also denoted by 7Z, which is given by 
g(TZ(X,Y)Z,W) = K(W,Z,X,Y). For this purpose we have to separate W in @ as 
a single argument of g. Using the definition of g and the selfadjointness of L and R w. r. to 
the Hilbert- Schmidt product we obtain after a straightforward calculation 

Tl<»{X i Y)Z= 2[^Z,^[^Y,^X}} 

, r i V LR r i v i all 
Ll+r^-' l+r Ll+r 1 ' L+R^JJ 

+ [ct^ct[lTr^lTr^]] +( 9 (Y,Z)X-g(X,Z)Y). (4) 
The Ricci tensor is defined by 

Ricci(Y, Z) := TV { X i-> K(X, Y)Z } . 
Eliminating X in (|]) yields 

Ricci® (Y, Z) = Tr { 2 ad^Z o ^ o ad^F o ^ 

+ ad L+R" [l+R^' L+R^] ° L+R 

+ ad LTR^ ° ° ad LTR^ ° lTr } +((n 2 -2)g(Y,Z)). (5) 

This equation requires some comments. adV^ denotes the usual commutation operator, 
adV(VT) := [V,W], and we have to do with compositions of operators. The trace should 
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be regarded, originally, on the real tangent spaces, that means on the Hermitian matrices, 
traceless or not. But the normal direction generated by g does not give any contribution to 
the trace in Ricci(Y,Z) because TZ(X,Y)Z vanishes for X Q := g. The additional term in 
the normalized case is the trace of X \— > g(Y, Z)X — g(X, Z)Y on the (n 2 — l)-dimensional 
space of traceless Hermitian matrices. Finally, the trace of a real operator equals the trace 
of its complexification. Therefore we can take the trace in ([5]) on all complex nxn-matrices. 

To continue the determination of the Ricci tensor we notice that the second term of the 
trace in (|5|) vanishes, because Tr adV^o(L + R) _1 = for all V. Indeed, we can suppose 
that g is diagonal. Then 

ad ^ ° lTr = E ^ ^ LTR E ^> = E ^(E. - E iif V) = . 

The remaining expression in (|J) must be symmetric in Y and Z, since the Ricci tensor is 
symmetric. Hence (^) reduces to 

Ricci^(Y, Z) = 3 Tr ad^F o o ad^Z o ^ + ( (n 2 - 2)g(Y, Z) ). (6) 

The Ricci tensor can be represented as Ricci (Y, Z) = g (Y,FR icci (Z)) , where the Ricci 
mapping Fmcd is a field of operators self-adjoint w. r. to the Bures metric and whose trace is 
the scalar curvature. We cannot expect that F^ icci is a simple expression in terms of L and 
R, e. g. like LR(L + R) . Indeed, if q is diagonal we obtain from (0) using the standard 
basis after a simple calculation 



Ricai(Y, Z)-3j2 {Xi + Xj){ £ X f x l {Xk + Xj) l^^ff^ W 



and 



F mca (z) = 6 x; (Ai+Afc )( Afc+A) ^ E « - 6 E j^tx^ z ^ (7b) 

for Y,Ze T g T>. To express the Ricci mapping for a general g we need the following natural 
mappings: 

m,m G : A® A — > A, A : A — >A®A, A := M nxn (C) , 

where m is the usual multiplication, m the opposite multiplication, m (X <%> Y) — YX, 
and A the comultiplication. It is the dual of m if we identify A and A* via A i— > (A, ■ ). 
Explicitly, 

A(Ejj) = E E ^® E fei- 

k 

It is obvious that these mappings are equivariant w.r. to the adjoint action of the unitary 
group, e. g. A(uXu*) = (Adu ® Adu) A(X). Using these mappings we have: 
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Proposition 2: 



RicciS 1 \Y,Z) = g{Y,¥^ icci {Z) 



where 



Ricci 



6(m 



™o) o (ft 



L+R 



L+R 



LR 

L+R 



)oAoj^ + ((n 2 -2)Id 



(8a) 



(8b) 



Proof: We prove the unnormalized case, the additional term in the normalized one is clear 
from (H). If g is diagonal the last equation follows by comparing (|8|) with fl7a|) . For general g 
it is sufficient to remark that the right hand side of ( jSb|) is a (l,l)-tensor field on T> invariant 



under the U(n)-conjugation. This implies the invariance of the right hand side of (5a). □ 

Now we proceed with the scalar curvature S = TrF^ CC j. Again, the normal direction 
does not give a contribution to the trace and we can take it on all complex matrices. We 
will use some obvious algebraic relations between the multiplication operators, e. g. 

m o (L <g) Id) = L o m o (Id <g> Id) , 

m o (R <g> Id) = m o (Id <g> L) , 
m Q o (R <g> Id) = R o m Q o (Id <g> Id) , 
m G o (L <g> Id) = m o (Id <g> R) , 



and obtain from (EE) 



S — Tr F fficc j 

= 6 Tr (L + R) o |mo (R®Id) -m G o (Id®R)| o (^g,^) o A o ^ R 
= 6 Tr |m o (R®Id) - m G o (Id®R) } o (^g^) o A . 



(9) 



The evaluation of this trace yields: 

Proposition 3: The scalar curvature on T> resp. T* 1 equals 



= 6 Tr g 



X' g ( 



-^Tr^ 1 + 



n 



l)(n 2 



Trh e (g) +((n 2 -l)(n 2 -2)) 



(10a) 
(10b) 



where Xq i> s the characteristic polynomial of g, x' Us derivative and h e the function given by 



h e (t) :=6t(lr — 



g + tl 



3 
2t 



Remark: Xq{~ £?) zs > f ac -t> invertible since Xs(~t) — YLi^i + implies Xg(~^j) > for 
all eigenvalues. □ 

Proof: It is sufficient to prove the assertion for diagonal g. For such g it is easy to calculate 
the trace (|9|) and we obtain 
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This is in accordance with formulae ([10|) . The additional term in the normalized case is 
obvious by (|8T5|) . □ 
The scalar curvature depends only on the invariants of g. In order to express it in terms 
of invariants we introduce the following matrix depending on g: 

{1 for i + 1 = j 

(-l) n - j e n+1 ^j for i = n 
otherwise 

where is the elementary invariant of degree i of g, i. e. x(t) — J2i=o e n-i{— t) 1 - Since 
E e has the same characteristic polynomial as g both matrices are conjugate provided the 
eigenvalues of g are different. Thus, at least for such points, we get from Proposition 3 



Corollary 2: 



S« = 6Tr£ ^L^-^5-i +(V-l)(n 2 -2) 
= Tr + ( {n 2 - l)(n 2 - 2) V where 

M*) : =6t(lr— *— V--. 



Since the set of g with different eigenvalues is dense, the Corollary is true for all points by 
continuity of the curvature. 

A further consequence of Proposition 3 is the following lower bound for the scalar cur- 
vature in the normalized case: 



Corollary 3: 



t (5n 2 -4)(n 2 -l) 
S °~ 2 



For n > 3 equality holds iff g = -1. For n = 2 the scalar curvature equals 24 for all g. 
Proof: The eigenvalues of g G T> 1 satisfy Aj = 1 and we have 



Aj + Afc / 4 ^-^ Afe ^-^ V z --' Aj + Afc / -f— ' Aj + A& 

K i k k 1 t=£k 

> f V Xk V V 1 > rc 2 fo-l) 2 n 2 (n- 1) = n 2 (n 2 - 1) 
_ ^^ + Aj ^Ai + A fc - 4 2 4 

Here we used the Schwartz inequality, the relation 

EAfc ^ Afc n n 2 n n(n — 1) 
. A~TA^ = A~TaI ~ 2 = Y ~ 2 = 2 

if^k i,k 
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and the fact that the arithmetic mean of all l/(Aj + A&), i 7^ k, is greater than or equal to 
the harmonic mean which equals n/2. Hence, equations ( |10|) and (|Tl|) imply 



„i 3 9/9 , , 9 w9 , (5n 2 — 4)(n 2 — 1) 
S, 1 > 2 "V - 1) + (" 2 - 1)(" 2 - 2) = - f 1 ■ 

Moreover, the bound is achieved for g = -1. Finally we note that for n = 2 the above 
estimations are, in fact, equations (S l = 24). For higher n this can hold only iff all A, + A&, 
i 7^ fc, are equal, i. e. iff Aj = 1/n. Hence, g = ^1 is the only minimal point. □ 
There is no upper bound for n > 2. Indeed, by (|TTD the scalar curvature equals up 
to a constant the sum of all 6Afc/((Aj + A^)(A^ + Aj)), where not all indices are equal. 
Therefore, S 1 tends to infinity iff e n -\ tends to zero, because e n _i is the sum of all A^ . . . , 
i\ < i 2 < ■ ■ ■ < i n -i- Roughly speaking S 1 diverges if we get close to density matrices of 
rank k < n — 1, in particular, if we get close to a pure state. 

Example: We consider the scalar curvature on T> 1 for n = 3 using Corollary 2: We have 
to set ei = 1. Then 

/ 1 0\ 

X (t) = -t 3 + t 2 - e 2 t + e 3 , S = 1 , 

V e 3 -e 2 1 / 

fe 3 1 \ / -e 2 -2 -3 \ 

*H9 = 2 e 3 e 3 -e 2 1 , x '(-f) = -3e 3 2e 2 -5 , 

\ e 3 e 3 - e 2 1 + e 3 - e 2 y y -5e 3 5e 2 - 3e 3 2e 2 - 5 J 

and we obtain 

S 1 = 6 Tr £ X '(-£)\(-£)- 2 - - Tr £^ + 56 = 2 28e 3~49e 2 -9 

2 e 3 - e 2 



Similarly we get for n 



= 4: 



cl _ 63 e 4 + 35 e| - 43 e 2 e 3 - 7 e 3 - 3 e 2 

c> — o ^ 

e 4 + - e 2 e 3 
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